Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 336776 |
| Missing cells | 46595 |
| Missing cells (%) | 0.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 48.8 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 13 |
year has constant value "2013" | Constant |
tailnum has a high cardinality: 4043 distinct values | High cardinality |
dest has a high cardinality: 105 distinct values | High cardinality |
time_hour has a high cardinality: 6936 distinct values | High cardinality |
dep_time is highly correlated with sched_dep_time and 3 other fields | High correlation |
sched_dep_time is highly correlated with dep_time and 3 other fields | High correlation |
dep_delay is highly correlated with arr_delay | High correlation |
arr_time is highly correlated with dep_time and 3 other fields | High correlation |
sched_arr_time is highly correlated with dep_time and 3 other fields | High correlation |
arr_delay is highly correlated with dep_delay | High correlation |
air_time is highly correlated with distance | High correlation |
distance is highly correlated with air_time | High correlation |
hour is highly correlated with dep_time and 3 other fields | High correlation |
dep_time is highly correlated with sched_dep_time and 3 other fields | High correlation |
sched_dep_time is highly correlated with dep_time and 3 other fields | High correlation |
dep_delay is highly correlated with arr_delay | High correlation |
arr_time is highly correlated with dep_time and 3 other fields | High correlation |
sched_arr_time is highly correlated with dep_time and 3 other fields | High correlation |
arr_delay is highly correlated with dep_delay | High correlation |
air_time is highly correlated with distance | High correlation |
distance is highly correlated with air_time | High correlation |
hour is highly correlated with dep_time and 3 other fields | High correlation |
dep_time is highly correlated with sched_dep_time and 3 other fields | High correlation |
sched_dep_time is highly correlated with dep_time and 3 other fields | High correlation |
arr_time is highly correlated with dep_time and 3 other fields | High correlation |
sched_arr_time is highly correlated with dep_time and 3 other fields | High correlation |
air_time is highly correlated with distance | High correlation |
distance is highly correlated with air_time | High correlation |
hour is highly correlated with dep_time and 3 other fields | High correlation |
carrier is highly correlated with year and 1 other fields | High correlation |
year is highly correlated with carrier and 1 other fields | High correlation |
origin is highly correlated with carrier and 1 other fields | High correlation |
dep_time is highly correlated with sched_dep_time and 3 other fields | High correlation |
sched_dep_time is highly correlated with dep_time and 3 other fields | High correlation |
dep_delay is highly correlated with arr_delay | High correlation |
arr_time is highly correlated with dep_time and 3 other fields | High correlation |
sched_arr_time is highly correlated with dep_time and 3 other fields | High correlation |
arr_delay is highly correlated with dep_delay | High correlation |
carrier is highly correlated with flight and 3 other fields | High correlation |
flight is highly correlated with carrier and 2 other fields | High correlation |
origin is highly correlated with carrier and 1 other fields | High correlation |
air_time is highly correlated with carrier and 2 other fields | High correlation |
distance is highly correlated with carrier and 1 other fields | High correlation |
hour is highly correlated with dep_time and 3 other fields | High correlation |
dep_time has 8255 (2.5%) missing values | Missing |
dep_delay has 8255 (2.5%) missing values | Missing |
arr_time has 8713 (2.6%) missing values | Missing |
arr_delay has 9430 (2.8%) missing values | Missing |
air_time has 9430 (2.8%) missing values | Missing |
dep_delay has 16514 (4.9%) zeros | Zeros |
arr_delay has 5409 (1.6%) zeros | Zeros |
minute has 60696 (18.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-03-09 03:59:05.575851 |
|---|---|
| Analysis finished | 2022-03-09 03:59:39.157408 |
| Duration | 33.58 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
| 2013 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2013 |
|---|---|
| 2nd row | 2013 |
| 3rd row | 2013 |
| 4th row | 2013 |
| 5th row | 2013 |
Common Values
| Value | Count | Frequency (%) |
| 2013 | 336776 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2013 | 336776 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
month
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.548509989 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.414457245 |
|---|---|
| Coefficient of variation (CV) | 0.5214097941 |
| Kurtosis | -1.186950055 |
| Mean | 6.548509989 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.01339988513 |
| Sum | 2205381 |
| Variance | 11.65851828 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 29425 | |
| 8 | 29327 | |
| 10 | 28889 | |
| 3 | 28834 | |
| 5 | 28796 | |
| 4 | 28330 | |
| 6 | 28243 | |
| 12 | 28135 | |
| 9 | 27574 | |
| 11 | 27268 | |
| Other values (2) | 51955 |
| Value | Count | Frequency (%) |
| 1 | 27004 | |
| 2 | 24951 | |
| 3 | 28834 | |
| 4 | 28330 | |
| 5 | 28796 | |
| 6 | 28243 | |
| 7 | 29425 | |
| 8 | 29327 | |
| 9 | 27574 | |
| 10 | 28889 |
| Value | Count | Frequency (%) |
| 12 | 28135 | |
| 11 | 27268 | |
| 10 | 28889 | |
| 9 | 27574 | |
| 8 | 29327 | |
| 7 | 29425 | |
| 6 | 28243 | |
| 5 | 28796 | |
| 4 | 28330 | |
| 3 | 28834 |
day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.71078699 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.768607102 |
|---|---|
| Coefficient of variation (CV) | 0.5581265347 |
| Kurtosis | -1.185945406 |
| Mean | 15.71078699 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.007744499321 |
| Sum | 5291016 |
| Variance | 76.8884705 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 11399 | 3.4% |
| 11 | 11359 | 3.4% |
| 22 | 11345 | 3.4% |
| 15 | 11317 | 3.4% |
| 8 | 11271 | 3.3% |
| 10 | 11227 | 3.3% |
| 17 | 11222 | 3.3% |
| 3 | 11211 | 3.3% |
| 21 | 11141 | 3.3% |
| 20 | 11111 | 3.3% |
| Other values (21) | 224173 |
| Value | Count | Frequency (%) |
| 1 | 11036 | |
| 2 | 10808 | |
| 3 | 11211 | |
| 4 | 11059 | |
| 5 | 10858 | |
| 6 | 11059 | |
| 7 | 10985 | |
| 8 | 11271 | |
| 9 | 10857 | |
| 10 | 11227 |
| Value | Count | Frequency (%) |
| 31 | 6190 | |
| 30 | 10289 | |
| 29 | 10039 | |
| 28 | 10773 | |
| 27 | 11084 | |
| 26 | 10883 | |
| 25 | 11097 | |
| 24 | 11041 | |
| 23 | 10966 | |
| 22 | 11345 |
dep_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1318 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 8255 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1349.109947 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 624 |
| Q1 | 907 |
| median | 1401 |
| Q3 | 1744 |
| 95-th percentile | 2112 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 837 |
Descriptive statistics
| Standard deviation | 488.281791 |
|---|---|
| Coefficient of variation (CV) | 0.3619288346 |
| Kurtosis | -1.088319991 |
| Mean | 1349.109947 |
| Median Absolute Deviation (MAD) | 428 |
| Skewness | -0.02474345303 |
| Sum | 443210949 |
| Variance | 238419.1074 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 555 | 834 | 0.2% |
| 755 | 820 | 0.2% |
| 556 | 818 | 0.2% |
| 557 | 799 | 0.2% |
| 655 | 798 | 0.2% |
| 1455 | 774 | 0.2% |
| 1454 | 769 | 0.2% |
| 654 | 751 | 0.2% |
| 855 | 743 | 0.2% |
| 754 | 742 | 0.2% |
| Other values (1308) | 320673 | |
| (Missing) | 8255 | 2.5% |
| Value | Count | Frequency (%) |
| 1 | 25 | |
| 2 | 35 | |
| 3 | 26 | |
| 4 | 26 | |
| 5 | 21 | |
| 6 | 22 | |
| 7 | 22 | |
| 8 | 23 | |
| 9 | 28 | |
| 10 | 22 |
| Value | Count | Frequency (%) |
| 2400 | 29 | < 0.1% |
| 2359 | 55 | |
| 2358 | 76 | |
| 2357 | 74 | |
| 2356 | 74 | |
| 2355 | 82 | |
| 2354 | 69 | |
| 2353 | 68 | |
| 2352 | 68 | |
| 2351 | 57 |
sched_dep_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 1021 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1344.25484 |
| Minimum | 106 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 106 |
|---|---|
| 5-th percentile | 630 |
| Q1 | 906 |
| median | 1359 |
| Q3 | 1729 |
| 95-th percentile | 2050 |
| Maximum | 2359 |
| Range | 2253 |
| Interquartile range (IQR) | 823 |
Descriptive statistics
| Standard deviation | 467.3357557 |
|---|---|
| Coefficient of variation (CV) | 0.3476541366 |
| Kurtosis | -1.197903099 |
| Mean | 1344.25484 |
| Median Absolute Deviation (MAD) | 414 |
| Skewness | -0.00585808289 |
| Sum | 452712768 |
| Variance | 218402.7086 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 600 | 7016 | 2.1% |
| 700 | 4900 | 1.5% |
| 630 | 4770 | 1.4% |
| 900 | 4766 | 1.4% |
| 1200 | 4624 | 1.4% |
| 1700 | 4526 | 1.3% |
| 1600 | 4098 | 1.2% |
| 800 | 3926 | 1.2% |
| 1300 | 3689 | 1.1% |
| 1900 | 3653 | 1.1% |
| Other values (1011) | 290808 |
| Value | Count | Frequency (%) |
| 106 | 1 | < 0.1% |
| 500 | 341 | |
| 501 | 1 | < 0.1% |
| 505 | 2 | < 0.1% |
| 510 | 5 | < 0.1% |
| 515 | 208 | |
| 516 | 4 | < 0.1% |
| 517 | 28 | < 0.1% |
| 520 | 7 | < 0.1% |
| 525 | 37 | < 0.1% |
| Value | Count | Frequency (%) |
| 2359 | 828 | |
| 2358 | 44 | < 0.1% |
| 2355 | 73 | < 0.1% |
| 2352 | 16 | < 0.1% |
| 2345 | 1 | < 0.1% |
| 2339 | 1 | < 0.1% |
| 2330 | 14 | < 0.1% |
| 2315 | 1 | < 0.1% |
| 2305 | 61 | < 0.1% |
| 2300 | 22 | < 0.1% |
| Distinct | 527 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 8255 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.63907026 |
| Minimum | -43 |
|---|---|
| Maximum | 1301 |
| Zeros | 16514 |
| Zeros (%) | 4.9% |
| Negative | 183575 |
| Negative (%) | 54.5% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | -43 |
|---|---|
| 5-th percentile | -9 |
| Q1 | -5 |
| median | -2 |
| Q3 | 11 |
| 95-th percentile | 88 |
| Maximum | 1301 |
| Range | 1344 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 40.21006089 |
|---|---|
| Coefficient of variation (CV) | 3.181409714 |
| Kurtosis | 43.95011603 |
| Mean | 12.63907026 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 4.802540511 |
| Sum | 4152200 |
| Variance | 1616.848997 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 24821 | 7.4% |
| -4 | 24619 | 7.3% |
| -3 | 24218 | 7.2% |
| -2 | 21516 | 6.4% |
| -6 | 20701 | 6.1% |
| -1 | 18813 | 5.6% |
| -7 | 16752 | 5.0% |
| 0 | 16514 | 4.9% |
| -8 | 11791 | 3.5% |
| 1 | 8050 | 2.4% |
| Other values (517) | 140726 | |
| (Missing) | 8255 | 2.5% |
| Value | Count | Frequency (%) |
| -43 | 1 | < 0.1% |
| -33 | 1 | < 0.1% |
| -32 | 1 | < 0.1% |
| -30 | 1 | < 0.1% |
| -27 | 1 | < 0.1% |
| -26 | 1 | < 0.1% |
| -25 | 2 | < 0.1% |
| -24 | 4 | < 0.1% |
| -23 | 6 | |
| -22 | 11 |
| Value | Count | Frequency (%) |
| 1301 | 1 | |
| 1137 | 1 | |
| 1126 | 1 | |
| 1014 | 1 | |
| 1005 | 1 | |
| 960 | 1 | |
| 911 | 1 | |
| 899 | 1 | |
| 898 | 1 | |
| 896 | 1 |
arr_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1411 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 8713 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1502.054999 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 736 |
| Q1 | 1104 |
| median | 1535 |
| Q3 | 1940 |
| 95-th percentile | 2248 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 836 |
Descriptive statistics
| Standard deviation | 533.264132 |
|---|---|
| Coefficient of variation (CV) | 0.3550230401 |
| Kurtosis | -0.1926343839 |
| Mean | 1502.054999 |
| Median Absolute Deviation (MAD) | 418 |
| Skewness | -0.4678190642 |
| Sum | 492768669 |
| Variance | 284370.6345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1008 | 485 | 0.1% |
| 1013 | 484 | 0.1% |
| 1015 | 479 | 0.1% |
| 1012 | 464 | 0.1% |
| 1005 | 460 | 0.1% |
| 1016 | 459 | 0.1% |
| 1006 | 459 | 0.1% |
| 1011 | 457 | 0.1% |
| 1007 | 456 | 0.1% |
| 1040 | 455 | 0.1% |
| Other values (1401) | 323405 | |
| (Missing) | 8713 | 2.6% |
| Value | Count | Frequency (%) |
| 1 | 201 | |
| 2 | 164 | |
| 3 | 174 | |
| 4 | 173 | |
| 5 | 206 | |
| 6 | 148 | |
| 7 | 170 | |
| 8 | 147 | |
| 9 | 140 | |
| 10 | 178 |
| Value | Count | Frequency (%) |
| 2400 | 150 | |
| 2359 | 222 | |
| 2358 | 189 | |
| 2357 | 207 | |
| 2356 | 202 | |
| 2355 | 206 | |
| 2354 | 195 | |
| 2353 | 182 | |
| 2352 | 193 | |
| 2351 | 216 |
sched_arr_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 1163 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1536.38022 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 815 |
| Q1 | 1124 |
| median | 1556 |
| Q3 | 1945 |
| 95-th percentile | 2246 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 821 |
Descriptive statistics
| Standard deviation | 497.4571415 |
|---|---|
| Coefficient of variation (CV) | 0.323785177 |
| Kurtosis | -0.3822477902 |
| Mean | 1536.38022 |
| Median Absolute Deviation (MAD) | 417 |
| Skewness | -0.3531380695 |
| Sum | 517415985 |
| Variance | 247463.6076 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1025 | 1324 | 0.4% |
| 2015 | 1234 | 0.4% |
| 1110 | 1198 | 0.4% |
| 1115 | 1193 | 0.4% |
| 1235 | 1133 | 0.3% |
| 2359 | 1121 | 0.3% |
| 1815 | 1111 | 0.3% |
| 1015 | 1080 | 0.3% |
| 1645 | 1079 | 0.3% |
| 1220 | 1073 | 0.3% |
| Other values (1153) | 325230 |
| Value | Count | Frequency (%) |
| 1 | 243 | |
| 2 | 95 | < 0.1% |
| 3 | 159 | |
| 4 | 107 | |
| 5 | 82 | < 0.1% |
| 6 | 19 | < 0.1% |
| 7 | 85 | < 0.1% |
| 8 | 154 | |
| 9 | 55 | < 0.1% |
| 10 | 72 | < 0.1% |
| Value | Count | Frequency (%) |
| 2359 | 1121 | |
| 2358 | 483 | |
| 2357 | 349 | 0.1% |
| 2356 | 468 | |
| 2355 | 335 | 0.1% |
| 2354 | 384 | 0.1% |
| 2353 | 263 | 0.1% |
| 2352 | 47 | < 0.1% |
| 2351 | 140 | < 0.1% |
| 2350 | 105 | < 0.1% |
| Distinct | 577 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9430 |
| Missing (%) | 2.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.895376757 |
| Minimum | -86 |
|---|---|
| Maximum | 1272 |
| Zeros | 5409 |
| Zeros (%) | 1.6% |
| Negative | 188933 |
| Negative (%) | 56.1% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | -86 |
|---|---|
| 5-th percentile | -32 |
| Q1 | -17 |
| median | -5 |
| Q3 | 14 |
| 95-th percentile | 91 |
| Maximum | 1272 |
| Range | 1358 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 44.63329169 |
|---|---|
| Coefficient of variation (CV) | 6.472930089 |
| Kurtosis | 29.233044 |
| Mean | 6.895376757 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 3.71681748 |
| Sum | 2257174 |
| Variance | 1992.130727 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -13 | 7177 | 2.1% |
| -10 | 7088 | 2.1% |
| -12 | 7046 | 2.1% |
| -14 | 6975 | 2.1% |
| -11 | 6863 | 2.0% |
| -9 | 6815 | 2.0% |
| -15 | 6796 | 2.0% |
| -7 | 6677 | 2.0% |
| -17 | 6668 | 2.0% |
| -8 | 6663 | 2.0% |
| Other values (567) | 258578 | |
| (Missing) | 9430 | 2.8% |
| Value | Count | Frequency (%) |
| -86 | 1 | < 0.1% |
| -79 | 1 | < 0.1% |
| -75 | 2 | < 0.1% |
| -74 | 1 | < 0.1% |
| -73 | 1 | < 0.1% |
| -71 | 3 | < 0.1% |
| -70 | 8 | |
| -69 | 7 | |
| -68 | 12 | |
| -67 | 7 |
| Value | Count | Frequency (%) |
| 1272 | 1 | |
| 1127 | 1 | |
| 1109 | 1 | |
| 1007 | 1 | |
| 989 | 1 | |
| 931 | 1 | |
| 915 | 1 | |
| 895 | 1 | |
| 878 | 1 | |
| 875 | 1 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
| UA | |
|---|---|
| B6 | |
| EV | |
| DL | |
| AA | |
| Other values (11) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UA |
|---|---|
| 2nd row | UA |
| 3rd row | AA |
| 4th row | B6 |
| 5th row | DL |
Common Values
| Value | Count | Frequency (%) |
| UA | 58665 | |
| B6 | 54635 | |
| EV | 54173 | |
| DL | 48110 | |
| AA | 32729 | |
| MQ | 26397 | |
| US | 20536 | 6.1% |
| 9E | 18460 | 5.5% |
| WN | 12275 | 3.6% |
| VX | 5162 | 1.5% |
| Other values (6) | 5634 | 1.7% |
Length
| Value | Count | Frequency (%) |
| ua | 58665 | |
| b6 | 54635 | |
| ev | 54173 | |
| dl | 48110 | |
| aa | 32729 | |
| mq | 26397 | |
| us | 20536 | 6.1% |
| 9e | 18460 | 5.5% |
| wn | 12275 | 3.6% |
| vx | 5162 | 1.5% |
| Other values (6) | 5634 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3844 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.92362 |
| Minimum | 1 |
|---|---|
| Maximum | 8500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 91 |
| Q1 | 553 |
| median | 1496 |
| Q3 | 3465 |
| 95-th percentile | 4695 |
| Maximum | 8500 |
| Range | 8499 |
| Interquartile range (IQR) | 2912 |
Descriptive statistics
| Standard deviation | 1632.471938 |
|---|---|
| Coefficient of variation (CV) | 0.8278575913 |
| Kurtosis | -0.8485606835 |
| Mean | 1971.92362 |
| Median Absolute Deviation (MAD) | 1085 |
| Skewness | 0.6616036349 |
| Sum | 664096549 |
| Variance | 2664964.629 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 968 | 0.3% |
| 27 | 898 | 0.3% |
| 181 | 882 | 0.3% |
| 301 | 871 | 0.3% |
| 161 | 786 | 0.2% |
| 695 | 782 | 0.2% |
| 1109 | 716 | 0.2% |
| 745 | 711 | 0.2% |
| 359 | 709 | 0.2% |
| 1 | 701 | 0.2% |
| Other values (3834) | 328752 |
| Value | Count | Frequency (%) |
| 1 | 701 | |
| 2 | 51 | < 0.1% |
| 3 | 631 | |
| 4 | 393 | |
| 5 | 324 | |
| 6 | 210 | 0.1% |
| 7 | 237 | 0.1% |
| 8 | 236 | 0.1% |
| 9 | 153 | < 0.1% |
| 10 | 61 | < 0.1% |
| Value | Count | Frequency (%) |
| 8500 | 1 | < 0.1% |
| 6181 | 80 | |
| 6180 | 6 | < 0.1% |
| 6177 | 164 | |
| 6171 | 1 | < 0.1% |
| 6168 | 2 | < 0.1% |
| 6167 | 3 | < 0.1% |
| 6165 | 1 | < 0.1% |
| 6140 | 1 | < 0.1% |
| 6138 | 2 | < 0.1% |
| Distinct | 4043 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 2512 |
| Missing (%) | 0.7% |
| Memory size | 2.6 MiB |
| N725MQ | 575 |
|---|---|
| N722MQ | 513 |
| N723MQ | 507 |
| N711MQ | 486 |
| N713MQ | 483 |
| Other values (4038) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.995222339 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 171 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | N14228 |
|---|---|
| 2nd row | N24211 |
| 3rd row | N619AA |
| 4th row | N804JB |
| 5th row | N668DN |
Common Values
| Value | Count | Frequency (%) |
| N725MQ | 575 | 0.2% |
| N722MQ | 513 | 0.2% |
| N723MQ | 507 | 0.2% |
| N711MQ | 486 | 0.1% |
| N713MQ | 483 | 0.1% |
| N258JB | 427 | 0.1% |
| N298JB | 407 | 0.1% |
| N353JB | 404 | 0.1% |
| N351JB | 402 | 0.1% |
| N735MQ | 396 | 0.1% |
| Other values (4033) | 329664 | |
| (Missing) | 2512 | 0.7% |
Length
| Value | Count | Frequency (%) |
| n725mq | 575 | 0.2% |
| n722mq | 513 | 0.2% |
| n723mq | 507 | 0.2% |
| n711mq | 486 | 0.1% |
| n713mq | 483 | 0.1% |
| n258jb | 427 | 0.1% |
| n298jb | 407 | 0.1% |
| n353jb | 404 | 0.1% |
| n351jb | 402 | 0.1% |
| n735mq | 396 | 0.1% |
| Other values (4033) | 329664 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
| EWR | |
|---|---|
| JFK | |
| LGA |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EWR |
|---|---|
| 2nd row | LGA |
| 3rd row | JFK |
| 4th row | JFK |
| 5th row | LGA |
Common Values
| Value | Count | Frequency (%) |
| EWR | 120835 | |
| JFK | 111279 | |
| LGA | 104662 |
Length
Pie chart
| Value | Count | Frequency (%) |
| ewr | 120835 | |
| jfk | 111279 | |
| lga | 104662 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 105 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
| ORD | 17283 |
|---|---|
| ATL | 17215 |
| LAX | 16174 |
| BOS | 15508 |
| MCO | 14082 |
| Other values (100) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | IAH |
|---|---|
| 2nd row | IAH |
| 3rd row | MIA |
| 4th row | BQN |
| 5th row | ATL |
Common Values
| Value | Count | Frequency (%) |
| ORD | 17283 | 5.1% |
| ATL | 17215 | 5.1% |
| LAX | 16174 | 4.8% |
| BOS | 15508 | 4.6% |
| MCO | 14082 | 4.2% |
| CLT | 14064 | 4.2% |
| SFO | 13331 | 4.0% |
| FLL | 12055 | 3.6% |
| MIA | 11728 | 3.5% |
| DCA | 9705 | 2.9% |
| Other values (95) | 195631 |
Length
| Value | Count | Frequency (%) |
| ord | 17283 | 5.1% |
| atl | 17215 | 5.1% |
| lax | 16174 | 4.8% |
| bos | 15508 | 4.6% |
| mco | 14082 | 4.2% |
| clt | 14064 | 4.2% |
| sfo | 13331 | 4.0% |
| fll | 12055 | 3.6% |
| mia | 11728 | 3.5% |
| dca | 9705 | 2.9% |
| Other values (95) | 195631 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
air_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 509 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9430 |
| Missing (%) | 2.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 150.6864602 |
| Minimum | 20 |
|---|---|
| Maximum | 695 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 40 |
| Q1 | 82 |
| median | 129 |
| Q3 | 192 |
| 95-th percentile | 339 |
| Maximum | 695 |
| Range | 675 |
| Interquartile range (IQR) | 110 |
Descriptive statistics
| Standard deviation | 93.68830466 |
|---|---|
| Coefficient of variation (CV) | 0.6217433506 |
| Kurtosis | 0.8630769908 |
| Mean | 150.6864602 |
| Median Absolute Deviation (MAD) | 51 |
| Skewness | 1.070705186 |
| Sum | 49326610 |
| Variance | 8777.49843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42 | 2552 | 0.8% |
| 43 | 2543 | 0.8% |
| 41 | 2513 | 0.7% |
| 45 | 2495 | 0.7% |
| 40 | 2466 | 0.7% |
| 44 | 2444 | 0.7% |
| 39 | 2411 | 0.7% |
| 47 | 2409 | 0.7% |
| 46 | 2406 | 0.7% |
| 109 | 2377 | 0.7% |
| Other values (499) | 302730 | |
| (Missing) | 9430 | 2.8% |
| Value | Count | Frequency (%) |
| 20 | 2 | < 0.1% |
| 21 | 14 | < 0.1% |
| 22 | 34 | < 0.1% |
| 23 | 82 | < 0.1% |
| 24 | 103 | |
| 25 | 124 | |
| 26 | 169 | |
| 27 | 147 | |
| 28 | 180 | |
| 29 | 209 |
| Value | Count | Frequency (%) |
| 695 | 1 | |
| 691 | 1 | |
| 686 | 2 | |
| 683 | 1 | |
| 679 | 1 | |
| 676 | 2 | |
| 675 | 1 | |
| 671 | 2 | |
| 669 | 1 | |
| 667 | 2 |
| Distinct | 214 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1039.912604 |
| Minimum | 17 |
|---|---|
| Maximum | 4983 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 199 |
| Q1 | 502 |
| median | 872 |
| Q3 | 1389 |
| 95-th percentile | 2475 |
| Maximum | 4983 |
| Range | 4966 |
| Interquartile range (IQR) | 887 |
Descriptive statistics
| Standard deviation | 733.2330333 |
|---|---|
| Coefficient of variation (CV) | 0.7050910151 |
| Kurtosis | 1.193639906 |
| Mean | 1039.912604 |
| Median Absolute Deviation (MAD) | 384 |
| Skewness | 1.128690151 |
| Sum | 350217607 |
| Variance | 537630.6812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2475 | 11262 | 3.3% |
| 762 | 10263 | 3.0% |
| 733 | 8857 | 2.6% |
| 2586 | 8204 | 2.4% |
| 544 | 6168 | 1.8% |
| 719 | 6100 | 1.8% |
| 187 | 5898 | 1.8% |
| 1096 | 5781 | 1.7% |
| 2454 | 5695 | 1.7% |
| 184 | 5504 | 1.6% |
| Other values (204) | 263044 |
| Value | Count | Frequency (%) |
| 17 | 1 | < 0.1% |
| 80 | 49 | < 0.1% |
| 94 | 976 | 0.3% |
| 96 | 607 | 0.2% |
| 116 | 443 | 0.1% |
| 143 | 439 | 0.1% |
| 160 | 376 | 0.1% |
| 169 | 545 | 0.2% |
| 173 | 221 | 0.1% |
| 184 | 5504 |
| Value | Count | Frequency (%) |
| 4983 | 342 | 0.1% |
| 4963 | 365 | 0.1% |
| 3370 | 8 | < 0.1% |
| 2586 | 8204 | |
| 2576 | 312 | 0.1% |
| 2569 | 329 | 0.1% |
| 2565 | 5127 | |
| 2521 | 284 | 0.1% |
| 2475 | 11262 | |
| 2465 | 1039 | 0.3% |
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.1802474 |
| Minimum | 1 |
|---|---|
| Maximum | 23 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 13 |
| Q3 | 17 |
| 95-th percentile | 20 |
| Maximum | 23 |
| Range | 22 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.661315708 |
|---|---|
| Coefficient of variation (CV) | 0.3536591966 |
| Kurtosis | -1.206416089 |
| Mean | 13.1802474 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.0005426517817 |
| Sum | 4438791 |
| Variance | 21.72786413 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 27242 | 8.1% |
| 6 | 25951 | 7.7% |
| 17 | 24426 | 7.3% |
| 15 | 23888 | 7.1% |
| 16 | 23002 | 6.8% |
| 7 | 22821 | 6.8% |
| 18 | 21783 | 6.5% |
| 14 | 21706 | 6.4% |
| 19 | 21441 | 6.4% |
| 9 | 20312 | 6.0% |
| Other values (10) | 104204 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 5 | 1953 | 0.6% |
| 6 | 25951 | |
| 7 | 22821 | |
| 8 | 27242 | |
| 9 | 20312 | |
| 10 | 16708 | |
| 11 | 16033 | |
| 12 | 18181 | |
| 13 | 19956 |
| Value | Count | Frequency (%) |
| 23 | 1061 | 0.3% |
| 22 | 2639 | 0.8% |
| 21 | 10933 | |
| 20 | 16739 | |
| 19 | 21441 | |
| 18 | 21783 | |
| 17 | 24426 | |
| 16 | 23002 | |
| 15 | 23888 | |
| 14 | 21706 |
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.23009953 |
| Minimum | 0 |
|---|---|
| Maximum | 59 |
| Zeros | 60696 |
| Zeros (%) | 18.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 8 |
| median | 29 |
| Q3 | 44 |
| 95-th percentile | 58 |
| Maximum | 59 |
| Range | 59 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 19.30084566 |
|---|---|
| Coefficient of variation (CV) | 0.7358281517 |
| Kurtosis | -1.235018012 |
| Mean | 26.23009953 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.09293094675 |
| Sum | 8833668 |
| Variance | 372.5226431 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 60696 | |
| 30 | 33899 | 10.1% |
| 45 | 20398 | 6.1% |
| 15 | 18868 | 5.6% |
| 55 | 18834 | 5.6% |
| 59 | 16288 | 4.8% |
| 10 | 14503 | 4.3% |
| 25 | 14450 | 4.3% |
| 5 | 14118 | 4.2% |
| 29 | 13823 | 4.1% |
| Other values (50) | 110899 |
| Value | Count | Frequency (%) |
| 0 | 60696 | |
| 1 | 2116 | 0.6% |
| 2 | 848 | 0.3% |
| 3 | 1439 | 0.4% |
| 4 | 1357 | 0.4% |
| 5 | 14118 | 4.2% |
| 6 | 1381 | 0.4% |
| 7 | 1092 | 0.3% |
| 8 | 1695 | 0.5% |
| 9 | 1445 | 0.4% |
| Value | Count | Frequency (%) |
| 59 | 16288 | |
| 58 | 1065 | 0.3% |
| 57 | 1388 | 0.4% |
| 56 | 1713 | 0.5% |
| 55 | 18834 | |
| 54 | 1405 | 0.4% |
| 53 | 1382 | 0.4% |
| 52 | 1281 | 0.4% |
| 51 | 1184 | 0.4% |
| 50 | 12508 |
| Distinct | 6936 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
| 2013-09-13T12:00:00Z | 94 |
|---|---|
| 2013-09-20T12:00:00Z | 94 |
| 2013-09-09T12:00:00Z | 93 |
| 2013-09-16T12:00:00Z | 93 |
| 2013-09-23T12:00:00Z | 93 |
| Other values (6931) |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2013-01-01T10:00:00Z |
|---|---|
| 2nd row | 2013-01-01T10:00:00Z |
| 3rd row | 2013-01-01T10:00:00Z |
| 4th row | 2013-01-01T10:00:00Z |
| 5th row | 2013-01-01T11:00:00Z |
Common Values
| Value | Count | Frequency (%) |
| 2013-09-13T12:00:00Z | 94 | < 0.1% |
| 2013-09-20T12:00:00Z | 94 | < 0.1% |
| 2013-09-09T12:00:00Z | 93 | < 0.1% |
| 2013-09-16T12:00:00Z | 93 | < 0.1% |
| 2013-09-23T12:00:00Z | 93 | < 0.1% |
| 2013-09-19T12:00:00Z | 92 | < 0.1% |
| 2013-10-11T12:00:00Z | 92 | < 0.1% |
| 2013-09-10T12:00:00Z | 91 | < 0.1% |
| 2013-10-09T12:00:00Z | 91 | < 0.1% |
| 2013-09-12T12:00:00Z | 91 | < 0.1% |
| Other values (6926) | 335852 |
Length
| Value | Count | Frequency (%) |
| 2013-09-13t12:00:00z | 94 | < 0.1% |
| 2013-09-20t12:00:00z | 94 | < 0.1% |
| 2013-09-09t12:00:00z | 93 | < 0.1% |
| 2013-09-23t12:00:00z | 93 | < 0.1% |
| 2013-09-16t12:00:00z | 93 | < 0.1% |
| 2013-09-19t12:00:00z | 92 | < 0.1% |
| 2013-10-11t12:00:00z | 92 | < 0.1% |
| 2013-09-24t12:00:00z | 91 | < 0.1% |
| 2013-10-01t12:00:00z | 91 | < 0.1% |
| 2013-09-18t12:00:00z | 91 | < 0.1% |
| Other values (6926) | 335852 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| year | month | day | dep_time | sched_dep_time | dep_delay | arr_time | sched_arr_time | arr_delay | carrier | flight | tailnum | origin | dest | air_time | distance | hour | minute | time_hour | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2013 | 1 | 1 | 517.0 | 515 | 2.0 | 830.0 | 819 | 11.0 | UA | 1545 | N14228 | EWR | IAH | 227.0 | 1400 | 5 | 15 | 2013-01-01T10:00:00Z |
| 1 | 2013 | 1 | 1 | 533.0 | 529 | 4.0 | 850.0 | 830 | 20.0 | UA | 1714 | N24211 | LGA | IAH | 227.0 | 1416 | 5 | 29 | 2013-01-01T10:00:00Z |
| 2 | 2013 | 1 | 1 | 542.0 | 540 | 2.0 | 923.0 | 850 | 33.0 | AA | 1141 | N619AA | JFK | MIA | 160.0 | 1089 | 5 | 40 | 2013-01-01T10:00:00Z |
| 3 | 2013 | 1 | 1 | 544.0 | 545 | -1.0 | 1004.0 | 1022 | -18.0 | B6 | 725 | N804JB | JFK | BQN | 183.0 | 1576 | 5 | 45 | 2013-01-01T10:00:00Z |
| 4 | 2013 | 1 | 1 | 554.0 | 600 | -6.0 | 812.0 | 837 | -25.0 | DL | 461 | N668DN | LGA | ATL | 116.0 | 762 | 6 | 0 | 2013-01-01T11:00:00Z |
| 5 | 2013 | 1 | 1 | 554.0 | 558 | -4.0 | 740.0 | 728 | 12.0 | UA | 1696 | N39463 | EWR | ORD | 150.0 | 719 | 5 | 58 | 2013-01-01T10:00:00Z |
| 6 | 2013 | 1 | 1 | 555.0 | 600 | -5.0 | 913.0 | 854 | 19.0 | B6 | 507 | N516JB | EWR | FLL | 158.0 | 1065 | 6 | 0 | 2013-01-01T11:00:00Z |
| 7 | 2013 | 1 | 1 | 557.0 | 600 | -3.0 | 709.0 | 723 | -14.0 | EV | 5708 | N829AS | LGA | IAD | 53.0 | 229 | 6 | 0 | 2013-01-01T11:00:00Z |
| 8 | 2013 | 1 | 1 | 557.0 | 600 | -3.0 | 838.0 | 846 | -8.0 | B6 | 79 | N593JB | JFK | MCO | 140.0 | 944 | 6 | 0 | 2013-01-01T11:00:00Z |
| 9 | 2013 | 1 | 1 | 558.0 | 600 | -2.0 | 753.0 | 745 | 8.0 | AA | 301 | N3ALAA | LGA | ORD | 138.0 | 733 | 6 | 0 | 2013-01-01T11:00:00Z |
Last rows
| year | month | day | dep_time | sched_dep_time | dep_delay | arr_time | sched_arr_time | arr_delay | carrier | flight | tailnum | origin | dest | air_time | distance | hour | minute | time_hour | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 336766 | 2013 | 9 | 30 | 2240.0 | 2250 | -10.0 | 2347.0 | 7 | -20.0 | B6 | 2002 | N281JB | JFK | BUF | 52.0 | 301 | 22 | 50 | 2013-10-01T02:00:00Z |
| 336767 | 2013 | 9 | 30 | 2241.0 | 2246 | -5.0 | 2345.0 | 1 | -16.0 | B6 | 486 | N346JB | JFK | ROC | 47.0 | 264 | 22 | 46 | 2013-10-01T02:00:00Z |
| 336768 | 2013 | 9 | 30 | 2307.0 | 2255 | 12.0 | 2359.0 | 2358 | 1.0 | B6 | 718 | N565JB | JFK | BOS | 33.0 | 187 | 22 | 55 | 2013-10-01T02:00:00Z |
| 336769 | 2013 | 9 | 30 | 2349.0 | 2359 | -10.0 | 325.0 | 350 | -25.0 | B6 | 745 | N516JB | JFK | PSE | 196.0 | 1617 | 23 | 59 | 2013-10-01T03:00:00Z |
| 336770 | 2013 | 9 | 30 | NaN | 1842 | NaN | NaN | 2019 | NaN | EV | 5274 | N740EV | LGA | BNA | NaN | 764 | 18 | 42 | 2013-09-30T22:00:00Z |
| 336771 | 2013 | 9 | 30 | NaN | 1455 | NaN | NaN | 1634 | NaN | 9E | 3393 | NaN | JFK | DCA | NaN | 213 | 14 | 55 | 2013-09-30T18:00:00Z |
| 336772 | 2013 | 9 | 30 | NaN | 2200 | NaN | NaN | 2312 | NaN | 9E | 3525 | NaN | LGA | SYR | NaN | 198 | 22 | 0 | 2013-10-01T02:00:00Z |
| 336773 | 2013 | 9 | 30 | NaN | 1210 | NaN | NaN | 1330 | NaN | MQ | 3461 | N535MQ | LGA | BNA | NaN | 764 | 12 | 10 | 2013-09-30T16:00:00Z |
| 336774 | 2013 | 9 | 30 | NaN | 1159 | NaN | NaN | 1344 | NaN | MQ | 3572 | N511MQ | LGA | CLE | NaN | 419 | 11 | 59 | 2013-09-30T15:00:00Z |
| 336775 | 2013 | 9 | 30 | NaN | 840 | NaN | NaN | 1020 | NaN | MQ | 3531 | N839MQ | LGA | RDU | NaN | 431 | 8 | 40 | 2013-09-30T12:00:00Z |